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LOG- STRUCTURED ARRAY 

Technical Field o£ the Invention 

The present invention relates generally to a method and system of 
adding storage devices to a log-structured array storage system. 

Background of the Invention 

Many current mid to high-end data processing systems employ storage 
systems that comprise arrays of small form factor direct access storage 
devices such as magnetic disk drives . Such arrays are designed to emulate 
large form factor devices while offering the advantages of lower cost, 
smaller size, improved performance and reduced power consumption. A number 
of alternative array architectures have been developed to achieve good 
overall array reliability; one of which is RAID (Redundant Array of 
Independent Disks) defined by the RAID advisory board and which includes a 
number of variants (commonly called "levels') . One commonly used variant is 
RAID 5 in which a data stripe consists of a number of data strips and a 
parity strip, each strip being stored on one member disk of the array. In 
RAID 5 the parity strips of different data stripes are distributed across 
the member disks. Further details of RAID 5 and the other variants may be 
found in the * RAID book' { ISBN 1-573 9 8-02 8-5) . 

A more recently developed array architecture is the Log Structured 

Array (LSA) in which the Log Structured Filesystem (LSF) approach, 

pioneered at University of California Berkeley, is combined with RAID. 

Examples of systems employing the LSA architecture are described in US 

Patents 5,124,987 and 5,671,390. LSA is most often described as being 
implemented using RAID 5 techniques. 

An LSA consists of a disk controller and N+l physical disks. In an 
LSA, data is stored on disks in compressed form. After a piece of data is 
updated, it may not compress to the same size as it did before it was 
updated, so it may not fit back into the space that had been allocated for 
it before the update. The implication is that there can no longer be 
fixed, static locations for all the data. An LSA controller manages 
information storage to write updated data into new disk locations rather 
than writing new data in place. Therefore, the LSA must keep a directory 
which it uses to locate data items in the array. 

As an illustration of the N+l physical disks, an LSA can be 
considered as consisting of a group of disk drive DASDs , each of which 
includes multiple disk platters stacked into a column. Each disk is 
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divided into large consecutive areas called s egment -columns . A 
segment -column is typically as large as a physical cylinder on a physical 
disk. Corresponding segment -columns from the N+l disks constitute a 
segment. The array has as many segments as there are segment -columns on a 
5 disk in the array. One of the segment -columns of a segment contains the 

parity { exclusive-OR) of the remaining segment -columns of the segment. For 
performance reasons, the parity segment -columns are not all on the same 
disk, but are rotated among the disks. 

10 Logical devices are mapped and stored in the LSA . A logical track is 

stored, as a set of compressed records, entirely within some segment of the 
LSA; many logical tracks can be stored in the same segment. The location 
of a logical track in an LSA changes over time. A directory, called the 
LSA directory, indicates the current location of each logical track. The 

15 LSA directory is maintained either in Non-Volatile Storage (NVS) or paged 

virtual memory in the disk controller. 

Reading and writing into an LSA occurs under management of the LSA 
controller. An LSA controller can include resident microcode that emulates 

20 logical devices such as direct access storage device (DASD) disk drives, or 

tape drives. - In this way, the physical nature of the external storage 
subsystem can be transparent to the operating system and to the 
applications executing on the computer processor accessing the LSA. Thus, 
read and write commands sent by the computer processor to the external 

2 5 information storage system would be interpreted by the LSA controller and 

mapped to the appropriate disk storage locations in a manner not known to 
the computer processor. This comprises a mapping of the LSA logical 
devices onto the actual disks of the LSA. 



3 0 A write received from the host system is first written into a 

non-volatile cache and the host is immediately notified that the write is 
done. The fraction of cache occupied by modified logical tracks is 
monitored by the controller. When this fraction exceeds some threshold, 
some number of modified logical tracks are moved (logically) to a memory 

35 segment, from where they get written (destaged) to disk. The memory 

segment is a section of controller memory, logically organized as N+l 
segment -columns called memory segment-columns; N data memory 
segment -columns and 1 parity memory segment -column. When all or part of a 
logical track is selected from the NVS, the entire logical track is written 

40 into one of the N data memory s egment - co lumns . When all data memory 

segment-columns are full, an XOR operation is applied to all the data 
memory segment -columns to create the parity memory segment -column, then all 
N+l memory segment -columns are written to an empty segment on the disk 
array. It is important for the performance of LSA that a segment is an 
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integral number of RAID 5 stripes so that parity can be calculated over the 
destaged data without reading the old data and parity and so incurring the 
RAID 5 write penalty. 

All logical tracks that were just written to disk from the memory 
segment must have their entries in the LSA directory updated to reflect 
their new disk locations. If these logical tracks had been written before 
by the system, the LSA directory would have contained their previous 
physical disk locations; otherwise the LSA directory would have indicated 
that the logical track had never been written, so has no address. Note 
that writing to the disk is more efficient in LSA than in RAID-5, where 4 
disk accesses are needed for an update. 

In Log Structured Arrays, data to be written is grouped together inco 
relatively large blocks (the segments) which are written out as a unit in a 
convenient free segment location on disk. When data is written, the 
previous disk locations of the data become free creating w holes" of unused 
data (or garbage) in the segments on disk. Eventually the disk fills up 
with segments and it is necessary to create free segment locations by 
reading source segments with holes and compacting their still-in-use 
content into a lesser number of destination segments without holes. This 
process is called free space or garbage collection. 

When the user of a non-LSA RAID array wishes to expand the storage 
capacity of the array, one option is to add one or more new member disk 
drives to the array. In RAID 5 arrays, the problem arises as to how to 
remap the data and/or parity whilst adding one or more disks. Furthermore a 
number of characteristics are important for any RAID array expansion scheme 
: (i) host IOs should not be held off for long periods of time; (ii) the 
amount of data/parity that needs to be moved should be minimised; and (iii) 
the translation of logical address to physical address should be kept as 
simple as possible since this calculation must be performed for every array 
IO access. A number of patents including US patents 5502836, 5524204, 
5615352 and 5758118 address the aforementioned problem and seek to trade 
off these desirable characteristics. 

Expansion capability for log structured arrays is also desirable. One 
relatively straight forward way of doing this would be to add a complete 
new RAID array to the log-structured array, as a log structured array may 
consist of one or more RAID arrays. However, the LSA user may not need all 
the additional storage capacity provided by an additional RAID array and 
therefore it would often be preferable to increase storage capacity by 
adding one or more member disk drives to a RAID array which is already a 
member of an LSA. However, a number of additional problems arise in adding 
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new members to a RAID array that forms part of an -LSA, none of which are 
addressed in the prior art. These are: (i) LSA performance is tied to the 
ability to perform full stripe writes for destaging segment sized IOs to 
the array. Thus the logical block addresses of the blocks in a stripe after 
the array expansion must be sequential; (ii) since the LSA segment size is 
tied to the array stripe size, if this changes, the segment size for new 
destages must change to match; and (iii) since LSA obtains new segments for 
filling by employing a garbage collection algorithm over previously 
destaged segments, it is convenient that newly destaged segments are the 
same size as pre-existing segments. 

It would be desirable to provide an improved expansion/contraction 
scheme for an information storage system configured as a log structured 
array. 

Summary of the Invention 

According to a first aspect of the invention there is provided a log 
structured array (LSA) controller apparatus for controlling the transfer of 
information between a processor and a plurality of information storage 
devices configured as an N+l array in which the information is stored as 
stripes extending across the devices of the array, each stripe comprising N 
information strips and one parity strip, each information strip storing an 
integer number of logical tracks; the controller further defining an LSA 
directory which specifies the location of each logical track in terms of 
the ID of the stripe to which the logical track belongs and the offset of 
the logical track within the stripe; wherein on the addition of an 
information storage device to the array, the additional strip provided for 
each stripe by the storage device is logically appended to the end of each 
stripe in the LSA directory. 

According to second aspect of the invention there is a provided a 
method of adding an information storage device to a plurality of 
information storage devices in an information processing system in which a 
processor is connected for communication with the information storage 
devices by means of a log structured array controller, the plurality of 
information storage devices being configured as an N+l array in which the 
information is stored as stripes extending across the devices of the array, 
each stripe comprising N information strips and one parity strip, each 
information strip storing an integer number of logical tracks, the 
controller further defining an LSA directory which specifies the location 
of each logical track in terms of the ID of the stripe to which the logical 
track belongs and the offset of the logical track within the stripe, the 
method comprising connecting the additional information storage device to 
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the log-structured array controller and logically appending the additional 
strip, provided to each existing stripe by the additional storage device, 
to the end of each stripe in the LSA directory. 

The present invention is thus predicated on the LSA directory 
defining the location of a logical track within the array using the 
construct {stripe number , of f set } rather than the traditional construct 
(logical block address} . In this way, the additional strip provided by the 
additional storage device for each existing stripe is logically appended to 
the end of the existing stripes and the addresses previously stored in the 
LSA are still correct. 



In a preferred embodiment, for array configurations where prior to 
the addition of the additional storage device the parity strips are rotated 

15 amongst the N+l information storage devices in accordance with the RAID-5 

architecture, the method further comprises moving selected parity strips to 
the additional information storage device at locations that would have 
stored parity strips had the array originally comprised N+2 information 
storage devices. In this way skew in the load on the storage devices Is 

2 0 avoided. 



25 



30 



A preferred embodiment of the present invention will now be 
described, by way of example only, with reference to the accompanying 
drawings . 

Brief Description of the Drawings 

Figure 1 is a representation of a computer system constructed in 
accordance with an embodiment of the present invention; 

Figure 2 is a tabular representation of an array of 3+P disk storage 
devices prior to the addition of the extra disk storage device; 

Figure 3 is a tabular representation of an expanded array prior to 
3 5 transfer of information to the extra disk storage device; and 

Figure 4 is a tabular representation of an expanded array in which 
parity strips have been moved to the additional disk storage device in 
accordance with one embodiment of the present invention. 

40 



Detailed Description of the Preferred Embodiment 
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Figure 1 shows a preferred embodiment of a computer system 100 
constructed in accordance with the present invention. The system 100 
includes a processor 102 or host computer that communicates with an 
external information storage system 104 having N+l direct access storage 
devices { DASD) in the form of disks in which information is maintained as a 
log structured array (LSA) . In accordance with the RAID 5 architecture, 
each disk is logically divided into large consecutive areas called 
segment-columns where a segment -column is typically as large as a physical 
cylinder on a physical disk. Corresponding segment-columns from the N+l 
disks constitute a segment. The array has as many segments as there are 
segment -columns on a disk in the array. One of the segment- columns of a 
segment contains the parity (exclusive-OR) of the remaining segment -columns 
of the segment. For performance reasons, x he parity segment -columns are 
not all on the same disk, but are rotated among the disks. 

In Figure 1, an array 106 comprising four disks 106a, 106b, 106c, and 
106d is shown for illustration, but it should be understood that the array 
may include a greater or lesser number of disks. A control unit 108 
controls the storage of information so that the array 106 is maintained as 
an LSA. Thus, the DASD recording area is divided into multiple 
segment -column areas and all like segment -columns from all the disks 
comprise one segment's worth of data. The control unit 108 manages the 
transfer of data to and from the array 106 so that periodically it 
considers segments for free space and selects target segments according to 
a fitness function described in detail below. 

The processor 102 includes (not illustrated) : one or more central 
processor units, such as a microprocessor, to execute programming 
instructions; random access memory (RAM) to contain application program 
instructions, system program instructions, and data; and an input/output 
controller to respond to read and write requests from executing 
applications. The processor 102 may be coupled to local DASD (not 
illustrated) in addition to being coupled to the LSA 104. Typically, an 
application program executing in the processor 102 may generate a request 
to read or write data, which causes the operating system of the processor 
to issue a read or write request, respectively, to the LSA control unit 
108. 

When the processor 102 issues a read or write request, the request is 
sent from the processor to the control unit 108 over a data bus 110 and is 
received in the control unit by a controller 112. In response, the 
controller produces control signals and provides them over a controller 
data path 114 to an LSA directory 116 and thereby determines where in the 
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LSA the data is located, either in a non-volatile LSA data cache 118 or in 
the DASD 106. The LSA controller 112 comprises one or more microprocessors 
with sufficient RAM to store programming instructions for interpreting read 
and write requests and for managing the LSA 104 in accordance with the 
present invention. 

Data is transferred between the processor 102 and the LSA 104 during 
read operations over a path including a read data path 120, DASD access 
circuits 122, the LSA data cache 118, controller access circuits 124, the 
controller data path 114, the controller 112, and the data bus 110. Data 
is transferred during write operations over a path including the data bus 
110, the controller 112, the controller data path 114, the controller 
access circuits 124,. the LSA data cache 118, the DASD access circuits 122, 
a segment data path 126, an accumulating memory segment input write buffer 
128, and a DASD write path 130. 

The data cache 118 permits delay of write operations on modified data 
logical tracks to the memory segment 128 for purposes of maintaining seek 
affinity. More particularly, if write operations to adjacent logical 
tracks are received, then all modified data in logically adjacent tracks 
will be moved into the memory segment 128 at the same time so they are 
stored in the same segment-column. This helps keep together logical tracks 
that are adjacent in the data cache so they will be adjacent when moved 
into the DASD array 106, thereby preserving seek affinity. The advantages 
and operation of the data cache 118 are described in greater detail in U.S. 
Pat. No. 5,551,003 issued Aug. 27, 1996 and assigned to International 
Business Machines Corporation. 

Preferably, the LSA data cache 118 is managed as a 
least-recently-used (LRU) cache, so that data is queued in the cache, with 
the most recently stored data at the top (or front) of the queue. In 
particular, the LSA data cache 118 is organized with clean data tracks in 
one LRU list and dirty tracks in another LRU list. The clean LRU list 
specifies logical tracks containing information wherein the data in the LSA 
cache is the same as the data in the DASD array, and the dirty LRU list 
specifies logical tracks containing modified data wherein data is different 
from the data in the DASD array. 

A basic operation of the storage system 104 is to write a particular 
logical track so as to change the contents of the logical track. In 
general, such live data tracks are first placed in the non-volatile data 
cache memory 118 of the LSA control unit 108. When the fraction of the 
cache occupied by modified logical tracks exceeds a predetermined value, 
the controller 112 logically moves a set number of modified. tracks to the 
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memory segment 128 by assigning them there. After one segment's worth of 
live tracks are moved into the memory segment, the tracks are written into 
contiguous locations of the DASD array 106. It should be understood that 
the operation of the data cache 118 is transparent to the processor 102 and 
therefore some operations of the storage system 104 will be described from 
the perspective of the processor, without reference to the data cache. 
Although the inclusion of a data cache 118. as described above can improve 
the overall performance of an LSA system, it should be understood that the 
inclusion of a data cache and the details of its implementation are not 
essential to the invention. 

The smallest unit of data that can be addressed by the LSA directory 
116 is called a logical track. If the processor 102 writes data comprising 
part of a track, the LSA control unit 10S ...ast read the remainder of the 
logical track from the DASD array 106 making reference to the current 
location of the logical track stored in the LSA directory 116 before 
writing the updated complete track into the memory segment buffer 128. A 
variable number of compressed logical cracks comprise a segment (depending 
upon how well the logical tracks compress) . At any time, a logical track 
is live, or current, in only one segment. In all other segments, the 
logical track is outdated, also referred to as being a dead track. From 
the perspective of the processor 102, a live data track is initially stored 
into controller memory (such as the data cache 118 or the input' memory 
segment write buffer 12 8) comprising a segment SEG 0 that initially is empty. 
That is, the segment SEG 0 resides in the controller memory as the segment is 
filled. 

If a logical track k is being written into the segment SEG 0 of 
controller memory and if the logical track k was previously live in some 
other DASD segment SEG in the DASD 106 before the write operation, then the 
track k becomes dead in the segment SEG and becomes live in the controller 
segment SEG C being filled. This continues until the segment SEG 0 in the LSA 
controller memory is filled to capacity, at which time the segment SEG 0 is 
destaged, i.e. it is moved from the memory segment buffer 128 and written 
to the DASD array 106. Another segment's worth of data is then filled in 
the controller memory and the process repeats until the next destage 
operation. In practice for performance reasons this logic may be duplicated 
resulting in multiple memory segment buffers 128 being filled concurrently. 
This concurrency does not change the basic principle of operation. 

As discussed above, it may become desirable to increase the overall 
storage capacity of the LSA. The present invention provides a technique 
whereby one or more DASDs can be added to the array. In the following 
description there will be detailed the process for adding a single 
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additional DASD to an existing array. However the techniques are 
applicable to the task of adding more than one new disk at a time. 

In the present invention, the LSA directory describes the location .of 
a track within a specific RAID array by using the construct {stripe number, 
offset} rather than simply {logical block address}. To use the technique 
and system of the present invention, it is required to employ an 
arrangement wherein a LSA segment maps to exactly one RAID stripe. Thus, a 
RAID strip and an LSA Segment column are the same thing and the block 
offset into a segment is the same as the block offset into the RAID stripe. 
It can be shown that this is the optimal arrangement for LSA in any case. 

These two predicates allow for the size of a segment to increase and 
potentially decrease (see the following discussion) without affecting the 
contents of the existing LSA directory. 

With reference now to Fig. 2 there is shown in tabular form a 3 + P 
RAID 5 array with a strip size of S logical blocks in which the data strips 
are numbered DO Dl D2 etc. and the parity strips are numbered P0 PI P2 etc. 
In this arrangement, each data strip starts at an array LB A given by its 
strip number (DO, Dl etc.) multiplied by S. Thus data strip D2 starts at 
LBA 2*S. 



It will be noted in passing that many RAID 5 implementations rotate 
the first strip in a stripe around the disks in the same way that that the 
25 parity rotates. This is usually done to improve the performance of 

sequential reads. Since sequential reads at the disk level are rare in LSA, 
this feature is not as attractive. The arrangement shown above was chosen 
for LSA in the present invention because it reduces the number of 
data/parity moves when a disk is added. 

30 

In the current invention, the procedure for adding a disk to the RAID 
array is as follows: 

1. The new disk is initialised to all binary 00s so that it can be included 
3 5 in the parity calculations without modifying the parity already on disk. 

2 . Accesses to the RAID array are temporarily suspended and any data cached 
by RAID 5 array location is flushed from the cache. 



3. The new disk (disk 4) is added as a member of the RAID array. The 
arrangement at this point in the process is as shown in Fig. 3. 



4. There are now several algorithms that can be applied to optionally 
relocate the parity and/or the data. 
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In the preferred embodiment, the data and parity are migrated from 
the arrangement of Fig. 2 to the arrangement of Fig. 3 such that the data and 
parity are moved to the position that they would have occupied had the 
array originally been created with 4 disks. In the preferred embodiment 
this migration happens gradually, alongside and as part of the normal 10 
workload of the disks with little impact. The algorithm for this task is as 
follows : 

A bitmap is initialised which has a single bit to represent each 
stripe in the array. The bit value indicates whether the data and parity in 
the strip are in the position shown in Fig. 2 or the position shown in Fig. 
3. 

At this point, 10 accesses to the array can be enabled. The algorithm 
for servicing them is as follows: 

Reads/Cache stages: If the strip being accessed is one for which the 
position of the data is different in Fig. 3 from the position in Fig. 2 
then the bitmap is examined in order to determine which disk to access. 

Destage operations are much simpler since these always occur as full 
stripe writes. In this case the data and parity are written at the position 
shown in Fig. 3 and the bitmap is updated if necessary to show that the 
data and parity have been moved. 

In an alternative arrangement, a background process is set up to 
migrate the data and parity from Fig. 2 format to Fig. 3 format. This is 
not preferred since it is much easier to migrate the data and parity 
between the formats during a full stripe write (no locking or reading needs 
to be performed) and because the skewing of the load on the disk has only 
very slight impact for the read only workload which is applied to the disks 
between segment destages in L3A . 

If the data and parity is relocated by a background process then it 
should be noted that the amount of parity and data which need to be moved 
is much less than (approximately half) that which has to be moved in 
schemes of the prior art which maintain optimal parity rotation and keep 
full strips sequential on disk. For example, in stripe 0 no movement at all 
needs to take place since the position of the data strips has not changed 
and we pre-initialised disk 4 with zeros so it can be used as the parity. 
In stripe 1 only PI and D5 need to be swapped, leaving D3 and D4 where they 
were . 
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The benefit of moving the parity to the Fig. 3 format is that the 
optimal rotation of parity on the disks is maintained. This is traded off 
against the cost of relocating the parity. 

In an alternative embodiment the parity is not moved, rather it is 
maintained in the format shown in Fig. 2. The benefit of this approach is 
that no parity movement must take place but this is at the cost of slightly 
skewing the load on the disks. A second alternative would be to swap the 
location of a parity strip and a single data strip on each stripe to 
achieve the optimal rotation of parity on the disks, but without keeping a 
sequential ordering of data strips across the disks. This would further 
reduce the amount of parity and data which need to be moved but would make 
the calculation to determine the location of data strips more complex. 

5. After adding the new drive to the array, each stripe now has an unused 
strip which is located logically as the last strip in the stripe. 

The appending of an extra strip onto the end of each stripe would not 
be viable if the addresses in the LSA directory were based upon logical 
block address. The addition of an extra strip into each stripe has changed 
the logical block address of every block of data after the first stripe in 
the array. Since in accordance with the preferred embodiment, the address 
stored in the LSA directory is of the form stripe {number , of f set } and the 
new strip is logically appended at the end of the existing stripes, the 
addresses stored in the LSA directory are still correct. 

All that has happened therefore is that each pre-existing segment has 
been expanded by the addition of some unused space at the end of the 
segment. This newly added space will become available for new destages as 
segments are garbage collected during the normal operation of the LSA. 

It is also possible to run this procedure in reverse to remove a disk 
from a RAID array LSA garbage collection is used to shrink every 
segment on the RAID array such that it no longer occupies the last strip in 
the stripe. As each segment is written back to the array, the strip on disk 
4 is filled with zeros and the parity Is written in the format shown in 



Fig. 2. 
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CLAIMS 



1. A log structured array (LSA) controller apparatus for controlling the 
transfer of information between a processor and a plurality of information 

5 storage devices configured as an N+l array in which the information is 

stored as stripes extending across the devices of the array, each stripe 
comprising N information strips and one parity strip, each information 
strip storing an integer number of logical tracks; the controller further 
defining an LSA directory which specifies the location of each logical 
10 track in terms of the ID of the stripe to which the track belongs and the 

offset of the track within the stripe; wherein on the addition of an 
information storage device to the array, the additional strip provided for 
each stripe by the storage device is logically appended to the end of each 
stripe in the LSA directory. 

15 

2. A method of adding an information storage device to a plurality of 
information storage devices in an information processing system in which a 
processor is connected for communication with the information storage 
devices by means of a log structured array controller, the plurality of 

2 0 information storage devices being configured as an N+l array in which the 

information is stored as stripes extending across the devices of the array, 
each stripe comprising N information strips and one parity strip, each 
information strip storing an integer number of logical tracks, the 
controller further defining an LSA directory' which specifies -the" location 

2 5 of each logical track in terms of the ID of the stripe to which the track 

belongs and . the offset of the track within the stripe, the method 
comprising connecting the additional information storage device to the 
log-structured array controller and logically appending the additional 
strip, provided to each existing stripe by the additional storage device, 

3 0 to the end of each stripe in the LSA directory. 

3. A method as claimed in claim 2, wherein prior to the addition of the 
additional storage device, the parity strips are rotated amongst the N+l 
information storage devices in accordance with the RAID-5 architecture, the 

3 5 method comprising moving selected parity strips to the additional 

information storage device at locations that would have stored parity 
strips had the array originally comprised N+2 information storage devices. 



40 



4. A method as claimed in claim 3, wherein the data and parity strips 
are moved to the additional storage device during normal 10 operations to 
the devices . 
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5 . 



A method as claimed in claim 3, wherein a background task is defined 
by the controller to move the data and parity strips to the additional 
storage device. 



A method as claimed in claim 4, wherein a bitmap is defined by the 
controller, each bit of the bitmap representing an array stripe and 
indicating whether the data and parity strips of the stripe are located in 
thexr onginal position or in the position appropriate to the expanded 
array. 

7. A method as claimed in any of claims 2 to 6 , wherein the additional 
information storage device is initialised to all binary zeros prior to 
connection to the controller. 

8. A method as claimed in any of claims 2 to 7 , comprising connecting a 
plurality of additional information storage devices to the log-structured 
array controller and logically appending the additional strips, provided to 
each existing stripe by the additional storage devices, to the end of each 
stripe in the LSA directory. 
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ABSTRACT 



LOG- STRUCTURED ARRAY 



A log structured array (LSA) controller apparatus controls the 
transfer of information between a processor and a plurality of information 
storage devices configured as an N+l array in which the information is 
stored as stripes extending across the devices of the array, each stripe 
comprising N information strips and one parity strip, each information 
strip storing an integer number of logical tracks. The controller defines 
an LSA directory that specifies the location of each logical track in terms 
of the ID of the stripe to which the:track belongs and the offset of the 
track within the stripe; wherein on the addition of an information storage 
device to the array, the additional strip provided for each stripe by the 
storage device is logically appended to the end of each stripe in the LSA 
directory. 
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